AITopics | conditional value

d02e9bdc27a894e882fa0c9055c99722-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 11:33:42 GMT

concentration inequality, random variable, risk measure, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Nevada (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

1a669e81c8093745261889539694be7f-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 16:04:07 GMT

Inthesecondexample, vacuuming the floors of a house has certain risks, but the consequences of optimizing the wrong rewardfunction arearguably muchlesssignificant.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Robots (0.70)

Add feedback

Learning Robust Options by Conditional Value at Risk Optimization

Neural Information Processing SystemsDec-25-2025, 08:29:13 GMT

Options are generally learned by using an inaccurate environment model (or simulator), which contains uncertain model parameters. While there are several methods to learn options that are robust against the uncertainty of model parameters, these methods only consider either the worst case or the average (ordinary) case for learning options. This limited consideration of the cases often produces options that do not work well in the unconsidered case. In this paper, we propose a conditional value at risk (CVaR)-based method to learn options that work well in both the average and worst cases. We extend the CVaR-based policy gradient method proposed by Chow and Ghavamzadeh (2014) to deal with robust Markov decision processes and then apply the extended method to learning robust options. We conduct experiments to evaluate our method in multi-joint robot control tasks (HopperIceBlock, Half-Cheetah, and Walker2D). Experimental results show that our method produces options that 1) give better worst-case performance than the options learned only to minimize the average-case loss, and 2) give better average-case performance than the options learned only to minimize the worst-case loss.

conditional value, learning robust option, name change, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.60)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

PAC-Bayesian Bound for the Conditional Value at Risk

Neural Information Processing SystemsDec-24-2025, 16:01:17 GMT

Conditional Value at Risk ($\textsc{CVaR}$) is a ``coherent risk measure'' which generalizes expectation (reduced to a boundary parameter setting). Widely used in mathematical finance, it is garnering increasing interest in machine learning as an alternate approach to regularization, and as a means for ensuring fairness. This paper presents a generalization bound for learning algorithms that minimize the $\textsc{CVaR}$ of the empirical loss. The bound is of PAC-Bayesian type and is guaranteed to be small when the empirical $\textsc{CVaR}$ is small. We achieve this by reducing the problem of estimating $\textsc{CVaR}$ to that of merely estimating an expectation. This then enables us, as a by-product, to obtain concentration inequalities for $\textsc{CVaR}$ even when the random variable in question is unbounded.

conditional value, name change, pac-bayesian bound, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.79)

Add feedback

PAC-Bayesian Bound for the Conditional Value at Risk

Neural Information Processing SystemsAug-16-2025, 13:39:52 GMT

The goal in statistical learning is to learn hypotheses that generalize well, which is typically formalized by seeking to minimize the expected risk associated with a given loss function.

concentration inequality, inequality, risk measure, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Nevada (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback

d02e9bdc27a894e882fa0c9055c99722-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 13:39:46 GMT

concentration inequality, random variable, risk measure, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Nevada (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Review for NeurIPS paper: PAC-Bayesian Bound for the Conditional Value at Risk

Neural Information Processing SystemsFeb-6-2025, 12:52:57 GMT

Conditional Value at Risk or Expected Shortfall (CVaR) of a random variable is the expected value of this random variable conditionally on the fact that this random variable exceeds a given value. As example, it quantifies the amount of tail risk an investment portfolio has. This kind of value is of importance in many situations and is getting more attention in the ML community. Indeed, a learned predictor that has a bad accuracy might nevertheless be of high utility if it get some a high CVaR provided there is a particular interest for examples that are in the best quantile (e.g., best drivers for a car insurance compagnies ... the only ones that should qualify for a reduction of their insurance quotes. There is still a lot to understand on CVaR from the learning theory point of view, this paper proposes the first known PAC-Bayesian bound for CVaR.

conditional value, neurips paper, pac-bayesian bound, (1 more...)

Neural Information Processing Systems

Industry: Banking & Finance > Insurance (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.66)

Add feedback

PAC-Bayesian Bound for the Conditional Value at Risk

Neural Information Processing SystemsOct-11-2024, 10:01:52 GMT

Conditional Value at Risk ( \textsc{CVaR}) is a coherent risk measure'' which generalizes expectation (reduced to a boundary parameter setting). Widely used in mathematical finance, it is garnering increasing interest in machine learning as an alternate approach to regularization, and as a means for ensuring fairness. This paper presents a generalization bound for learning algorithms that minimize the \textsc{CVaR} of the empirical loss. The bound is of PAC-Bayesian type and is guaranteed to be small when the empirical \textsc{CVaR} is small. We achieve this by reducing the problem of estimating \textsc{CVaR} to that of merely estimating an expectation.

conditional value, pac-bayesian bound, textsc, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.88)

Add feedback

Learning Robust Options by Conditional Value at Risk Optimization

Neural Information Processing SystemsOct-9-2024, 23:30:29 GMT

Options are generally learned by using an inaccurate environment model (or simulator), which contains uncertain model parameters. While there are several methods to learn options that are robust against the uncertainty of model parameters, these methods only consider either the worst case or the average (ordinary) case for learning options. This limited consideration of the cases often produces options that do not work well in the unconsidered case. In this paper, we propose a conditional value at risk (CVaR)-based method to learn options that work well in both the average and worst cases. We extend the CVaR-based policy gradient method proposed by Chow and Ghavamzadeh (2014) to deal with robust Markov decision processes and then apply the extended method to learning robust options.

conditional value, learning robust option, risk optimization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Minimax Optimal Algorithms for Unconstrained Linear Optimization H. Brendan McMahan Jacob Abernethy

Neural Information Processing SystemsMar-13-2024, 21:24:36 GMT

We design and analyze minimax-optimal algorithms for online linear optimization games where the player's choice is unconstrained. The player strives to minimize regret, the difference between his loss and the loss of a post-hoc benchmark strategy. While the standard benchmark is the loss of the best strategy chosen from a bounded comparator set, we consider a very broad range of benchmark functions. The problem is cast as a sequential multi-stage zero-sum game, and we give a thorough analysis of the minimax behavior of the game, providing characterizations for the value of the game, as well as both the player's and the adversary's optimal strategy. We show how these objects can be computed efficiently under certain circumstances, and by selecting an appropriate benchmark, we construct a novel hedging strategy for an unconstrained betting game.

algorithm, conditional value, sequence, (12 more...)

Neural Information Processing Systems

Country: